Pitch estimation in noisy speech based on temporal accumulation of spectrum peaks
نویسندگان
چکیده
In this paper, we present a study on robust pitch estimation by integrating spectral and temporal information in speech. Spectrum harmonics are important representations of the speech fundamental frequency. Harmonic-related spectral peaks of speech evolve much more slowly than the spectral peaks of noise. This motivates the proposition of temporally accumulated peak spectrum (TAPS), which is computed by cumulating spectrum peaks over consecutive analysis frames. In the TAPS, harmonic-related peaks are concentrated around the fundamental frequency and its multiples, while the peaks caused by noise are irregularly distributed with relatively small amplitude. A pitch estimation method is derived based on TAPS. The peak locations on the autocorrelation of TAPS indicate the frequency separations between the harmonic peaks, which are used to estimate the fundamental frequency. The proposed method is evaluated on speech signals corrupted by white noise, speech noise and babble noise. The results of pitch estimation show that our method performs more robustly and reliably than conventional time-domain and cepstrum-domain methods.
منابع مشابه
Multi-band summary correlogram-based pitch detection for noisy speech
A multi-band summary correlogram (MBSC)-based pitch detection algorithm (PDA) is proposed. The PDA performs pitch estimation and voiced/unvoiced (V/UV) detection via novel signal processing schemes that are designed to enhance the MBSC’s peaks at the most likely pitch period. These peak-enhancement schemes include comb-filter channel-weighting to yield each individual subband’s summary correlog...
متن کاملNoise Suppressor using Zero Phase Signal and Accumulated Spectrum Technique
This paper proposes a wide-band noise reduction method using temporal accumulated spectrum and zero phase (ZP) signal. In the previous study, we replace the ZP signal around the origin with the ZP signal in the second or latter period to get an estimated speech ZP signal. For very low SNR environment, reliable period estimation is difficult. This paper presents a study of period estimation in n...
متن کاملNoisy speech enhancement based on long term harmonic model to improve speech intelligibility for hearing impaired listeners
This study proposes a speech enhancement algorithm to improve speech intelligibility for hearing impaired listeners in adverse conditions. The proposed algorithm is based on a long term harmonic model, where the harmonics of target speech are more distinguished from noise spectrum interference. Our method consists of two stages: i) Prominent pitch estimation based on long term harmonic feature ...
متن کاملDOA Estimation with Local-Peak-Weighted CSP
This paper proposes a novel weighting algorithm for Cross-power Spectrum Phase (CSP) analysis to improve the accuracy of direction of arrival (DOA) estimation for beamforming in a noisy environment. Our sound source is a human speaker and the noise is broadband noise in an automobile. The harmonic structures in the human speech spectrum can be used for weighting the CSP analysis, because harmon...
متن کاملPitch estimation of noisy speech signals using empirical mode decomposition
This paper presents a pitch estimation method of noisy speech signal using empirical mode decomposition (EMD). The normalized autocorrelation function (NACF) of the noisy speech signal is decomposed into a finite set of band-limited signals termed as intrinsic mode functions (IMFs) using EMD. The periodicity of one IMF is supposed to be equal to the accurate pitch period. A conventional autocor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010